Postfiltering Techniques in Low Bit - Rate Speech
نویسنده
چکیده
Postfilters are used in speech decoders to improve speech quality by preserving formant information and reducing noise in the valley regions. In this thesis, a new adaptive least-squares LPC-based time-domain postfilter is presented to overcome problems presented in the conventional LPC-based time-domain postfilter. Conventional LPC-based time-domain postfilter [4] produces an unpredictable spectral tilt that is hard to control by the modified LPC synthesis, inverse, and high pass filtering, causing unnecessary attenuation or amplification of some frequency components that introduces muffling in speech quality. This effect increases when voice coders are tandemed together. However, the least-squares postfilter solves these problems by eliminating the problem of spectral tilt in the conventional time-domain postfilter. The least-squares postfilter has a flat frequency response at formant peaks of the speech spectrum. Instead of looking at the modified LPC synthesis, inverse, and high pass filtering as in the conventional time-domain technique, a formant and null simultaneous tracking technique is adopted by taking advantage of a strong correlation between formants and poles in the LPC envelope. The least-squares postfilter has been used in the 4 kb/s Harmonic Excitation Linear Predictive Coder (HE-LPC) and subjective listening tests indicate that the new postfiltering technique outperforms the conventional one in both one and two tandem connections. Thesis Supervisor: Dr. Suat Yeldener Title: Scientist, Voiceband Processing Department, Comsat Laboratories Thesis Supervisor: Dr. Thomas F. Quatieri Title: Senior Member of the Technical Staff, MIT Lincoln Laboratory
منابع مشابه
Perceptual Postfiltering for Low Bit Rate Speech Coders
Adaptive postfiltering has become a common part of speech coding standards based on the Linear Prediction Analysis-by-Synthesis algorithm to decrease audible coding noise. However, a conventional adaptive postfilter is based on empirical assumptions of masking phenomena, which sometimes makes it hard to balance between noise reduction and speech distortion. This thesis introduces a novel percep...
متن کاملPerceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering
This paper proposes a new low rate speech coding algorithm, based on a subband approach. At first, a frame of the incoming signal is fed to a low pass filter, thus yielding the low frequency (LF) part. By subtracting the latter from the incoming signal the high frequency (HF), non-smoothed part is obtained. The HF part is modeled using waveform vector quantisation (VQ), while the LF part is mod...
متن کاملStrategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec - Vision, Image and Signal Processing, IEE Proceedings-
This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser techniq...
متن کاملAn Enhanced Low Bit Rate Audio Codec Using Discrete Wavelet Transform
Audio coding is the technology to represent audio in digital form with as few bits as possible while maintaining the intelligibility and quality required for particular application. Interest in audio coding is motivated by the evolution to digital communications and the requirement to minimize bit rate, and hence conserve bandwidth. There is always a tradeoff between lowering the bit rate and m...
متن کاملA signal-separation-based array postfilter for distant speech recognition
In standard microphone array processing for distant speech recognition, the beamformed output is postfiltered to reduce residual noise. Postfiltering is usually performed through a Weiner filter whose parameters are estimated from both the beamformer output and the signals captured at the microphones themselves. Conventional postfiltering methods assume diffuse or incoherent noise at the variou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013